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Abstract 



In the absence of Rec A- mediated cleavage of the repressor, the A 
prophage is exceptionally stable. In fact, the repressed state is then 
more stable than the gene encoding the repressor. We develop a math- 
ematical treatment that predicts the stability of such epigenetic states 
from affinities of the molecular components. We apply the model to 
the behavior of recently published mutants of Or and find that their 
observed stability indicates that the current view of the Or switch is 
incomplete. The approach described here should be generally appli- 
cable to the stability of expressed states. 

1 Introduction 

The lysogeny maintenance switch in phage A can be viewed as one of the sim- 
plest examples on the molecular level of computation, command and control 
in a living system. If, following infection of the bacterium Escherichia coli, 
the virus enters the lysogenic pathway, it represses its developmental func- 
tions, in response to a small set of sensory inputs, and integrates its DNA 
into the host chromosome. In this state the prophage may be passively repli- 
cated for many generations of E. coli. 

Established lysogeny is maintained by the protein CI which blocks opera- 
tors Or and Ol on the A DNA, and thereby prevents transcription of all 
lytic genes including cro (Ptashne, 1992). In lysogeny the CI concentration 
functions as a sensor of the state of the bacterium: if DNA is damaged the 
protease activity of RecA is activated, leading to degradation of CI. A low 
CI concentration allows for transcription of the lytic genes, starting with cro, 
the product of which is the protein Cro. 

According to the picture so beautifully conveyed by Ptashne (Ptashne, 1992), 
the A lysogeny maintenance switch is centered around operator Or, and con- 
sists of three binding sites OrI, Or2 and Or3, each of which can be occupied 
by either a Cro dimer or a CI dimer. As illustrated in Fig. 1 these three bind- 
ing sites control the activity of two promoters Prm and Pr for respectively cl 
and cro transcription. Transcription of cro starts at Pr, which partly over- 
laps OrI and Or2. Transcription of cl starts at Prm, which partly overlaps 



2 



Or2 and Or3. The affinity of RNA polymerase for the two promoters, and 
subsequent production of the two proteins, depends on the pattern of Cro 
and CI bound to the three operator sites and thereby establishes lysogeny 
with about 200 CI molecules per bacterium. If, however, CI concentration 
becomes sufficiently low, the increased production of Cro throws the switch 
to lysis. 

There does not seem to be a quantitative theoretical understanding of how 
stable the lysogenic state is to spontaneous switching. We will demonstrate 
that the large stability observed experimentally puts constraints on the mech- 
anism of the switch, constraints which we explore in a quantitative model. 
We will show that in order to span the timescales from single molecule on-off 
binding times of order about 10 sec, to the stability of order 10 12 sec, a simple 
model, where lysis occurs every time cro is transcribed, is inadequate. We 
elaborate a dynamical model that takes into account all known mechanisms 
of Or, including in particular nonspecific binding between Cro dimers and 
E. coli DNA. We can then reproduce the observed high stability. However, 
we will see that the recently reported robustness of the switch against muta- 
tional changes of the operators (Little et al, 1999) suggest that an additional 
mechanism of stability must be present. 

Bistable switches are expected to be important functional units in many sys- 
tems of control of gene expression (Monot & Jacob, 1961, Glass & Kauffman, 
1973). One general consequence of the mathematical model presented here 
is, that in living systems such switches can often only be of finite stability, on 
account of the small number of molecules involved. The model is hence also 
relevant to the stability of data storage in synthetic gene-regulatory networks 
with similar characteristics, in particular if intended for use in biotechnology 
or gene therapy (Gardner et al, 2000). 

2 Experiments on lysogen stability 

Repression of bacteriophage A can be eliminated either by cleavage of the 
repressor by the RecA protease (reviewed by Roberts & Devoret, 1983) or 
by the absence of repressor at the operators. The frequency of spontaneous 
induction in strains deleted for the recA gene has recently been reported. 
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Rozanov et al, (1998) found the frequency of infectious centers to be 1.1- 10 7 . 
Little et al, (1999) measured the ratio of released phage per lysogen and the 
burst size. They estimate the frequency of one cell to switch to the lytic state 
to be 4 • 10~ 7 per generation, assuming the switch occurred one hour before 
lysis. Little et al. verified the lysogens contained only a single prophage by 
PCR. Rozanov et al, (1998) do not report testing the number of prophages 
in the lysogens they used. We repeated these measurements using single lyso- 
gens (Powell et al, 1994) of MG1655 (Jensen, 1993) containing the del(srl- 
recA)306::TnlO allele (Csonka et al. 1979). We measured the frequency of 
switching to the lytic state by measuring the appearance of turbid plaques 
in a limiting dilution experiment at 37° in YT broth (Miller, 1972). We find 
that lysed cells to appear with frequency between 1.0 and 1.8 • 1CT 8 , which, 
if we include the same correction factor as (Little et al, 1999) to account for 
the increase in cell numbers during the period of phage development, implies 
a rate in the range of 3.5 • 10~ 8 and 6.0 • 10 -8 per generation and cell. Thus, 
all three sets of measurements fall within a factor of 10, in spite of the use 
of different strain backgrounds. 

Recent results of J. Little (Univ. of Arizona, personal communication) show 
that at 37°C about 99% of the turbid plaques are not wild type, but instead a 
marginally stable PRM240 mutant. As a consequence, Little estimates that 
wild type A phages inside E. coli cells grown on nutrient rich medium have a 
frequency of spontaneous lysis that does not exceed 2 ■ 10~ 9 per generation 
and cell. 

3 Elements of a model 

The considerations in this section are in part closely similar to the model of 
(Shea & Ackers, 1985). The main difference is that our interest is in longer 
time scales, and in the stability of the switch against spontaneous stochastic 
fluctuations. We will therefore begin with a discussion of characteristic time 
scales, then discuss chemistry, production and decay. The stochastic element 
is included in a definite model, see next section. 

Time scales: The measured half lives of CI and Cro dissociation from R 1 
at 22°C are respectively 34 sec for CI (Nelson & Sauer, 1985), and varies 



4 



from 1200 sec for Cro (Jana et al, 1998) dissociation from Or alone down 
to 20 sec (Kim et al. (1987)) from OR flanked by lkb of A-DNA. At equi- 
librium dissociation balances association, and thus the time it takes a CI or 
Cro dimer to bind to the operator is 

n = T off V [1M] exp(AG/RT) (1) 

where AG is the free energies of binding, which for CI and Cro are tabu- 
lated in Table [I], and RT = 0.617 kcal/mol. Assuming a bacterial volume 
of V = 2 ■ 10~ 15 Z (Bremmer et. al, 1996) we have V[1M) = 12 ■ 10 8 , and 
can then estimate the association rates of single molecules to be of order 
tens of seconds, for both CI and Cro. This is compatible with diffusion- 
limited association into a region of size L = 1 nm, at a rate determined 
by the Smoluchowski equation (Berg et al, 1982) t\ = V/(4ttDL), using 
a diffusion constant D = 10 _11 m 2 /s, similar to that of other proteins in 
bacterial cytoplasma (Elowitz et al, 1999). The association rate from N 
molecules is proportional to N, and thus for a bacterium with 10-200 CI and 
Cro molecules association events will occur at a frequency of fractions of a 
second, for both CI and Cro. 

The shortest time scale in which we are actually interested is that which 
it takes for a switching event to take place. This cannot be much less than 
those of significant changes in CI and Cro concentrations, which are of or- 
der one bacterial generation, 34 minutes in the strains used in (Little et al, 
1999). Hence, all binding- unbinding events of Cro and CI can be assumed 
to take place in homeostatic equilibrium. In addition, we need to character- 
ize RNA polymerase binding and initiation of transcription from promotor 
sites Prm and Pr. As discussed below (see Production) these events also 
have to occur on time scales much shorter than one bacterial generation, and 
can therefore be described by their overall rates. We note that a more de- 
tailed description of RNA polymerase action may be necessary to resolve the 
shorter timescales associated to the lysis- lysogeny entry decision, as discussed 
in modelling by McAdams, Arkin and Ross (1997,1998). 

Chemistry: We assume that CI and Cro molecules in the cell are in homeo- 
static equilibrium. This does not mean that there is always the same number 
of CI dimers bound to the operators at any particular time. These numbers 
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are fluctuating, and the equilibrium assumption gives the size of these fluctu- 
ations. The key inputs are CI and Cro dimerization constants and the Gibbs 
free energies for their bindings to the three operator sites OrI, Or2 and Or 3 
(Koblan & Ackers, (1992); Takeda et al, 1989, Kim et al. (1987), Jana et 
al. (1997,1998)), see Table 0. 

We remark that these free energies are taken from in vitro studies, and that 
the in vivo conditions could be different, e.g. the measured protein-DNA 
affinities could depend sensitively on the ions present in the buffer solutions. 
On the other hand, in vivo the effects of such changes should be compen- 
sated for, as e.g. changed KCl concentrations are by putrescine (Capp et 
al. (1996)) and other ions and crowding effects (Record et al. (1998)) The 
data quoted in Table [l] was obtained at KCl concentration of 200mM, which 
most closely resembles in vivo conditions. Parameter sensitivity is analysed 
separately below. 

Following Ackers et al. (1982), we encode a state s of CI and/or Cro bound 
to Or by three numbers (i, j, k) referring to respectively Or3 Or2 and OrI. 
The coding is if the corresponding site is free, 1 if the site is occupied by 
a CI dimer, and 2 if the site is occupied by a Cro dimer. The probability of 
a state s with i s CI dimers and j s Cro dimers bound to Or is in the grand 
canonical approach of Shea and Ackers (Shea & Ackers, 1985) 

P R ( S ) = AT" 1 [CI HCro Y* e - G{s)/T (2) 

The normalization constant A/" -1 is determined by summing over s. The 
free dimer concentrations ([CI ] and [Cro ]) are equal to the numbers of free 
dimers per volume, where the free dimer numbers are given by 

2n D = N -n M - 2J\f chr nu 

-2 • A/a i s Pr(s) + £ vP L («')j (3) 

where N is the total number of molecules of this type, A/a and M c hr the 
average number of copies present of the A genome and of the full E. coli 
chromosome, rijj the number of dimers bound nonspecifically to one chromo- 
some DNA and tim the number of free monomers. We assume free dimers to 
be in equilibrium with free monomers and with dimers bound nonspecifically 
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to DNA, such that um and njj can be expressed in terms of no, volume, 
and dimerization and association constants. In this paper we take M\ and 
M c hr both equal to 3 0. In our analysis we have mostly assumed that CI 
does not bind nonspecifically to DNA, while Cro does so with considerable 
affinity The effect of a putative nonspecific CI binding to DNA is analysed 
separately, see Table 1 and |. Note that we also include occupancy of Ol in 
order to account for finite depletion of CI, which has some significance for 
the fractional binding at Or when the total number of CI molecules is small. 
Parameters for Ol are listed in caption to Table 1. 

In Table [I] we have calculated V(s) = V(s \ Nci = 200, iVoo = 0). Of partic- 
ular interest for the stability are the states in which cro can be transcribed, 
i.e. (000), (100) and (200). The numbers in the table show that cro is al- 
ways transcribed at some rate in lysogeny, which if every cro transcription 
induced lysis, would destabilize the lysogenic state within a few generations. 
In Fig. 2a we show P(0, 0, 0) + P(l, 0, 0) + P(2, 0, 0) as function of N C i at 
Nero = 0. We remark that Fig. 2a corresponds to the scaling of Pr promoter 
strength in the in vitro experiment of Hawley & McClure (1983). The small 
gap in CI number before full activation of Prm at low CI in Fig. 2a originates 
in part from finite system filling of Or and Ol- 

Production: CI and Cro are produced from mRNA transcripts of cl and 
cro, which are initiated from promotor sites Prm and Pr. The rate of ini- 
tiation of transcription from P RM when stimulated by CI bound to Or2 is 
denoted Prm, and when not stimulated P RM - The number of CI molecules 
produced per transcript is S c i, and overall expected rate of production of CI 
is 

Overall CI rate = R RM S c i (P(010) + P(011) + P(012)) + 
KmSci(P(000) + P(001) + P(002) + 
P(020) + P(021) +P(022)) (4) 

For the relative values of the coefficients, Hawley & McClure (1982) report 
Prm to be equal to Prm/H- The absolute values are not known, and are 
likely to at least be proportional to the concentration of RNA polymerase in 
the cell. We discuss how we determine Prm below. According to (Shean & 
Gottesman, 1992) the number of CI molecules produced from one transcript 
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of cl is a factor 20 — 70 smaller compared to la.cZ. Thus S c j is small, between 
one and five, and we take S c i to be one. 

As there are about 200 CI in a lysogenic cell, and as each transcription only 
results in a small number of CI molecules, transcription has to be initiated 
from Prm many times per generation in lysogeny. We can therefore safely 
assume that the time scales for RNA polymerase to find the operator sites is 
also much less than one generation. This justifies our description of produc- 
tion by the overall rates, without distinguishing RNA polymerase binding, 
the rate of formation of open complexes, and possible temporary stalling of 
RNA polymerase at promotor sites. Fig. 2b shows average CI production 
according to (f|) as function of CI number in the cell, volume held constant. 
We note that the overall behaviour resembles the in vitro measurements of 
Hawley & McClure (1983). 

The rate of initiation of transcription from promoter Pr is denoted Pr and 
the number of Cro molecules per transcript S cto . From Ringquist et al, 1992 
one can estimate S cro to be 51% of ideal lacZ and (Kennel & Riezman, 1977) 
thus allow a lower estimate of S CIO to be 20. Thus, in contrast to the many 
small production events of CI , Cro production is intermittent. The ratio 
of Pr to Prm has been reported to be between 1.3 and 20, and depends 
strongly on RNA polymerase concentration in the cell (Hawley & McClure, 
1982). In practice we determine -Rrm by balancing CI production and de- 
cay, as discussed below, and use Pr as a free parameter. We end this part 
by stressing that -Rr, -Rrm, and P RM parametrize the total production rates 
from all copies of A DNA in the cell. The real rates of transcription per Or 
complex will generally be smaller, e.g. about a factor A/a ~ 3 smaller at 
high growth rates. 

Decay: Concentrations decay due to dilution and degradation. For CI we 
only take into account dilution from cell growth and division, while for Cro 
we include in addition an in vivo half life time t cro of about one hour at 37° 
(Pakula et al., 1986), which is significant compared to e.g. generation time 
of 34 minutes in the experiments of Little et al, 1999. 

The steady state: Balance between production and decay of CI over one 
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generation, where production is given by (H) and decay by -p 21 , gives one 

1 — v Hife 

equation for the parameter -Rrm, if the total numbers of CI and Cro in 
lysogeny are assumed known. Fig. 2b displays the steady state as the inter- 
section between the linear dilution and the Prm activity curve, with 200 CI 
and no Cro molecules in the system. 

The average number of Cro molecules in lysogens has not been measured, 
but in the models considered here this number is not zero. Equation (H) 
below gives the average rate of Cro production. If we assume that .Rr is 
not very different from -Rrm, as Shea & Ackers (1985), table fixes the rate 
of cro transcription in lysogeny to be once in five to ten generations, and 
if we assume -Rr to be an order of magnitude larger than -Rrm, then cro is 
transcribed on average once per generation. In both cases, this production is 
balanced by Cro decay to give an average number of Cro. Hence, the balance 
of only CI production and decay does not completely determine -Rrm, since 
the probabilities depend parametrically on the Cro concentrations, which in 
turn depend on .Rr. However, the implied dependence of -Rrm on _R R is not 
very pronounced. In the model presented below we fit -Rrm and -Rr to data, 
where the value of 0.115 for -Rrm is not very sensitive to changes in model 
assumptions. 

4 The model 

The basic time scale in our model is one bacterial generation, where we resolve 
shorter times around cro transcription events when necessary. A simulation 
goes as follows. Each bacterial generation starts with iVci total number of 
CI, and iVoo total number of Cro. From these one computes by equation (0) 
the occupation probabilities P s of the states s of CI and Cro dimers bound 
to Or, and from these the average CI and Cro production rates fci and fc TO - 
The first is computed by equation above, and the second by 

Overall Cro rate = S cro R R (P oo + Rioo + -P200) (5) 

CI production is assumed continuous, since S c i is about one. Cro produc- 
tion is on the other hand treated as discrete events. In each generation the 
bacterial volume V is assumed to start at V star t = ^V ave , and then to grow 
linearly until it has doubled. V ave = 2 ■ 10~ 15 / is the average volume over 
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one generation. The growth of bacterial DNA is treated similarly, with an 
average of 3 DNA chromosomes (~ 15 • 10 6 base pairs per cell). 

1) The rates fci and /c ro are computed with the current values of Nqi, Nc to 
and V. cro transcription events are assumed Poisson distributed with mean 
waiting time S CIO / f c ro - A random time t r is drawn from this distribution. If 
this time is less than the remaining time of the current bacterial generation, 
one cro transcription will occur, and we continue the simulation under point 
2b) below. Else, no more cro transcription will take place in this generation, 
and we continue under 2a). 

2a) The time since beginning of the generation, or since last cro transcription 
in the generation, is t. We update CI numbers by the expected number 
produced in the interval t, fci • t, plus £, a random number drawn from a 
Gaussian distribution with zero mean and variance fci • t. We update Cro 
number by removing each Cro molecule with a probability p = p(t) = 2~*/* Cro , 
and then proceed to cell division under point 3) below. 

2b) We update CI number and degrade Cro number as above, 2a), but under 
time t r . The number of Cro molecules produced from this cro transcription 
is a random variable drawn from a Poisson distribution of mean S crcn and is 
added to the remaining Cro. We now return back to point 1) above for the 
remaining time of the current generation. 

3) This is the stage of cell division. The volume of a daughter cell is restarted 
at V s tarti and will contain each CI and Cro molecule of the mother cell with 
probability one half. If CI number has shrunk below a threshold, a switch to 
the lytic pathway has occurred, and we leave the loop. Otherwise we return 
to 1) with the new values of CI and Cro. 

The expected number of generations before lysis is calculated by an average 
over at least 10 independent trajectories. The value of the threshold in CI 
number is unimportant, provided it is chosen sufficiently low. In the model 
presented here lysis practically always follows if N C i goes below 20. 
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5 Results 



Standard parameter behaviour: The two parameters -Rrm and .Rr have 
been adjusted such that the average number of CI molecules in lysogeny is 
200, and the mean rate of spontaneous lysis is ~ 2 • 10~ 9 generations. The 
other parameters values are discussed above in section §. The values thus 
obtained are -Rrm = 0.115sec _1 and i?R = O.SOsec" 1 . The ratio between 
these two rates is hence about three, well within the limits set by Hawley & 
McClure (1982) but larger than the ratio 1.3 used in the model by (Shea & 
Ackers, 1985). 

For these parameters we show in Fig. 3 the last few hundred generations 
before lysis in a typical simulation. The numbers of CI and Cro fluctuate for 
a long time around the metastable lysogenic state. A particularly large fluc- 
tuation (of both CI and Cro) then builds up over a few generations, leading 
to lysis. This kind of switching event is similar to the escape over an activa- 
tion barrier as in the well-known diffusion model of chemical reactions. Thus, 
spontaneous lysis is the result of a number of unlikely events happening in 
a given short time interval. If each event has probability p < 1, and lysis 
depends on n events, the resulting frequency of lysis is ~ p n . This frequency 
is therefore very sensitive to p, that means to parameters in the model. 

In Fig. 4 we explore the activation barrier picture in a more detailed way 
by computing the probability that the system over time visits various values 
of A CI and Ac ro - The distribution has a maximum around the lysogenic state, 
as it should. From there to the origin, along the Aci axis, the probability 
becomes almost vanishingly small, which reflects the fact that a fluctuation 
of CI alone, without Cro, cannot induce lysis. The switch instead happens 
along a ridge in the (A CI , A Cro ) plane, where probability does not go down 
so quickly. In Fig. 4b we examine the tail of the distribution in Fig. 4a. One 
observes a saddle for high Cro number and Nci about 20 — 25. For the pa- 
rameter values used here, lysis follows with 50% probability, as soon as Aci 
goes below 23. It is natural to assume that lysogeny is separated from lysis 
by a basin boundary (a separatrix), which runs close to parallel to the Ac ro 
axis around the saddle point. The value of Cro at the saddle can be varied 
from about a hundred to well over a thousand in the model, and depends 
sensitively on the numerical values of Cro affinities to specific and nonspecific 
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DNA. 



Parameter sensitivity: A main objective to build a model for stability 
is to address which key quantities that determines the stability. As we have 
already discussed, stability is governed by near simultaneous occurrences of 
many rare events. Consequently, the stability will be very sensitive to the 
probability of these events, which in turn depend sensitively on parameters, 
in particular of the parameters describing Cro production in the cell. We now 
examine parameter sensitivity, i.e. lysis frequency as function of production 
rates and binding affinities. The least uncertain combination is the product 
Sd ■ Rrm, which is to good accuracy determined by the number of CI in 
lysogeny, known to be between 180 and 350 for growth in rich medium. The 
individual terms in this product are less well known, but only influence the 
dynamics through the relative amplitude of noise in CI production, which 
will be proportional to the square root of 5 c j, all else equal. This uncertainty 
is much less than what stems from Cro, as we describe next, and will there- 
fore be ignored. 

In Table ^] the top row refers to a reference set-up, and the three follow- 
ing rows show the effects of varying, subsequently, Cro degradation rate t^, 
Pr, and finally Pr and S CTO keeping the product Pr • S CTO fixed. Thus, with 
given binding affinities the mean rate of switching increases with any of S CTO , 
Pr and tcro, but by varying two of these parameters in opposite directions, 
one may also reproduce the measured stability. Varying different binding 
affinities we find that stability primarily is determined by the free energy 
difference between Cro binding to Or3 and Cro binding to nonspecific DNA. 
In Fig. 5 we show the dependence of stability with this difference, for three 
different values of -Rr. At low lysis frequency, a four-fold change in Pr cor- 
responds to a little less than a PT hi 4 change in binding energy difference 
(approximately equal to 0.9kcal/mol) Changes in Cro affinities to OrI and 
Or2 do not influence stability significantly. 

CI influences the stability of a lysogen by binding to OrI and Or2, which de- 
termines the fraction of time Pr is open (see Fig. 1.). Increasing for instance 
binding to these sites by 0.5kcal/mol means that the probability that Pr is 
open is reduced from 0.000071 to 0.000013, a five-fold weaker activation of 
Pr. CI also indirectly influences the stability of a lysogen through the bind- 
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ing to 0r3, which determines the fraction of time Prm is open. To balance 
decay through dilution at given total number of CI, such a change must be 
compensated by an increase in the product PrmSW- An increase in this must 
then be matched by a change in Pr to maintain the measured stability. Thus 
any CI affinity increase can be compensated by a corresponding increase in 
Pr- 

The conclusion so far is that with present parameter information the model is 
compatible with observed rate of spontaneous lysis in recA— E. coli strains, 
but we have to look elsewhere for a more stringent test, given the present 
lack of knowledge of especially parameters related to Cro. 

The Little mutants: The last section of the Table shows the mean rate 
of switching in variants of the model corresponding to the recently reported 
lysogenic state in A mutants, where either operator R 1 is replaced by a 
copy of Or3, or vice versa (Little et ai, 1999). Thus while wild type (321) 
has an Or site made up of 0r3Or2Or1, the mutant labelled mutant 121 
has 0r10r20r1 at Or, and mutant 323 Or30r20r3. Varying parameters 
related to Cro and promoter strengths we find it impossible to reproduce 
simultaneously the measured stability of wild type recA— lysogens and the 
existence of stable lysogens in these mutants. For 323 the cause of decreased 
stablity is demonstrated in Fig. 2a, which shows that Pr is activated much 
more than in wild type. For 121, on the other hand, the lack of stability is 
reflected in a much lower CI level in lysogens, as demonstrated in Fig 2b. 
Thus the two mutants demonstrate two mechanisms of destabilization, one 
through enhanced Cro, the other through depleted CI. 

When adjusting Pr down by a factor 10 we can obtain stable lysogens for 
121, and if by a factor 100, then also for 323. In both cases the stability 
of wild-type increases enormously, and does not match the supposed lysis 
frequency of 2 • 10~ 9 per cell and generation. These parameter changes allow 
us however to analyse what lysogens in the mutants would look like, as in 
the last six lines of Table 2. We see that the number of CI for 121 is 15-20% 
of wild- type, while for 323 the number is 50-60%. Both of these numbers are 
similar to the CI levels reported by Little et al. (1999). These CI levels can 
also be found by balancing production and decay of CI only, as in Fig. 2b. 
In further qualitative agreement with experiment we observe that in spite of 
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having more CI, the 121 mutant is markedly more stable than 323. This is 
mainly caused by the much larger probability for open Pr in 323, as seen 
from Fig. 2a. 

Finally we report that the above conclusions remain valid also if CI binds 
significantly to nonspecific DNA, see appendix: In that case we may simul- 
taneously fit stability of wild-type and 121, but not wild-type and 323. 

6 Discussion 

The switch to lysis in our model is essentially a first exit time problem, in a 
system influenced by a combination of deterministic and stochastic forces. A 
well-known analogy is thermal escape of a particle from a potential (Kramers, 
(1940); Hanggi et ai, 1990), a model of chemical reactions with activation 
energies. There the rate of escape depends exponentially on the height of 
the potential barrier, and is therefore very sensitive to small changes in the 
potential. Our case is not exactly in Kramers' form, but the analogy is nev- 
ertheless illuminating. We also find that the rate of spontaneous switching 
depends very sensitively on model parameters. In fact, the negative of the 
logarithm of the probability from Fig. 4 can be identified with the Wentzel- 
Freidlin quasi-potential (Freidlin & Wentzell, 1984), (Maier & Stein, 1997), 
which plays the same role in this more general exit time problem as the 
potential in Kramers' problem. An analogy to temperature is on the other 
hand the noises associated to typical fluctuations of CI and Cro numbers in 
a lysogen. We intend to return to these questions in a future publication. 

The large stability of the lysogenic state of the A puts constraints on working 
mechanisms of the switch. First, we have shown that a simple model, where 
every transcription of cro leads to lysis, only provides stability for a few gener- 
ations. A mechanism must therefore exist which stabilizes the switch against 
most transcriptions of cro. We have explored a straight-forward model, where 
lysogens are stabilized by a spontaneous lysis that require a number of cro 
translations within each of 3-5 subsequent bacteria generations. Thereby the 
small stability arising from a single cro translation is raised to a high power, 
implying a very rare spontaneous breakdown of lysogeny. The need for mul- 
tiple cro translations for break down of lysogen stability is associated to the 
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following mechanisms: (i) Cro bound to operators is in homeostatic equilib- 
rium, and CI production is therefore possible with Cro present; (ii) Cro is 
degraded and diluted over time; (iii) Cro binds also to nonspecific DNA, with 
significant affinity. Measured lysogenic stability of wild-type recA— E. coli 
can then be reproduced in the model. 

The actual stability depends sensitively on the parameters in the model, 
and we thus predict a stability which changes dramatically with changes in 
e.g. growth conditions. The model allows for quantitative examination of 
mutants, by changing parameters in the model, and we have in this sense 
investigated the Or mutants reported by Little et al, 1999. Although the 
model can reproduce the ratios of CI numbers in mutant and wild-type lyso- 
gens, and of the stabilities of each one of wild-type, 121 and 323 mutants, it 
fails for all parameter values to reproduce simultaneously all of the stabili- 
ties. The discrepancy is largest between wild type and 323. This suggests 
that some additional mechanism outside the Or complex contributes to the 
stability. Origins of such a mechanism could be either an increased repression 
of Cro production by CI relative to our model, or, conversely, a decreased 
repression of CI production by Cro. Examples of such mechanisms are: i) 
additional cl transcription directed from Pre; ii) an unknown interaction 
between Cro and CI that allows Cro to stabilize lysogens; iii) CI mediated 
binding between Ol and Or if that would significantly repress Pr; iv) that 
Cro is unable to block Prm completely, even when bound to Or3. 

The first mechanism i) would give a second role to Pre in repressor main- 
tenance, while mechanism ii) was suggested already by (Eisen et al, 1982) 
based on a study of the Hyp phenotype. An argument for mechanism iii) is 
the recent report by Revet et al (1999), which shows a fourfold increase in 
repression of Pr in a plasmid construction involving two operator complexes, 
both of which can bind CI dimers, and one of which overlaps with Pr as in 
wild- type A Or. These mechanisms are of course not exclusive. 

In summary, we have developed a quantitative model for the stability of lyso- 
gens. The model builds a direct connection between processes, their affinities 
and the resulting stability. The model allows for quantitative numerical tests 
of genetic feed-back mechanisms on the molecular level, and points to pos- 
sible shortcomings in the standard model of A lysogeny maintenance switch. 
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Finally, we wish to point out again that the model developed here speaks 
directly to the stability of data storage in biocomputing (Gardner, et al. 
2000). 

Appendix: Consequences of nonspecific CI bind- 
ing 

(Koblan et al, 1992) report that binding per base pair between CI dimers and 
nonspecific DNA is at least 9kcal/mol weaker than binding between CI and 
Or3. This sets a limit on nonspecific binding to be at most — 3.5kcal/(mol- bp). 
Here we consider the effects of a — 3.0kcal/(mol- bp) nonspecific binding be- 
tween CI dimers and DNA, see also Table 1. The 2- 10 -9 stability of lysogens 
with iVci = 200 is then obtained with i? RM = 0.085/sec and i?R = 0.02/sec. 
In this state Or3 is 20% occupied in lysogeny, in agreement with (Maurer et 
al, 1980). When we also apply these parameters to the 121 and 323 mutants 
we find that 121 can form stable lysogens (with N C i(l21) m 40 and lysis 
frequency in the range 10~ 3 to 10" 5 ), whereas 323 fails to do so. This lack 
of stability of 323 in the model fails to reproduce the stability observed by 
(Little et al, 1999). 
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Figure 1: Right operator complex, Or, consisting of the three operators 
OrI , Or2 and Or3. cl is transcribed when Or3 is free and Or2 is occupied 
by CI. cro is transcribed when both Or2 and OrI is free. CI dimers bind 
cooperatively to OrI and Or2. 



Figure 2: a) Probability of open cro promoter, P(-00) = P(000) + P(100) + 
P(200), as function of CI number in E.coli cell of volume 2 • /im 3 . The 
dashed lines marked 121 and 323 show repression for the two Or mutants 
investigated by J. Little (Little et al, 1999). b) Activity of Prm in units of 
activity when Or3 is free and Or2 is occupied by CI. Same notation as in 
a). For relative activity of nonstimulated Prm promoter we use the data of 
(Hawley & McClure, 1982). 
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Figure 3: Time course of last few hundreds of generations before lysis in a 
typical simulation with the standard set of parameters. The average switch- 
ing rate is 1.4 • 10~ 9 per cell and generation. The real level of Cro in a 
lysogenic cell is unknown. In our model, almost all of the Cro present is 
bound nonspecifically to DNA in lysogeny, the total level could be changed 
by at least a factor 10 by changing the (uncertain) difference in Cro affinity 
to Or3 and to nonspecific DNA. 



Figure 4: a) Probability to be at various values of CI and Cro number in 
the cell during lysogeny. Due to finite number of samples, one cannot see the 
rare fluctuations leading to lysis, b) As in a), but limited to the rare events 
where N Cro > 50. 



Figure 5: Frequency of spontaneous lysis (/) as function of the affinity ratio 
r = exp(AAG/i?T) between specific and nonspecific Cro binding. The full 
line is for standard setup (e.g. R R = 0.30/s), and one observe a large relative 
change in / with r, implying that the adjustment in bindings that leads 
from high to very high stability is small. The +) show effect of increased 
nonspecific Cro - DNA binding by lkcal/(mol-bp), and its similarity to the 
full line marks that no free Cro are present. The dashed lines show lysis 
frequency with respectively -Rr = 1.2/s and -Rr = 0.075/s. 
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AGqi-dna — 


AGci-DNA — 






0.0kcal/(mol- bp) 


— 3.0kcal/(mol- bp) 


State s 


AG [kcal/mol]. 


P(s,N=200) 


P(s,N=200) 


(000) 


0.0 


0.00005 


0.00063 


(001) 


-12.5 


0.00301 


0.01269 


(010) 


-10.5 


0.00012 


0.00050 


(100) 


-9.5 


0.00002 


0.00010 


(Oil) 


-25.7 


0.59291 


0.80189 


(101) 


-22.0 


0.00147 


0.00199 


(110) 


-22.9 


0.00632 


0.00855 


(111) 


-35.4 


0.39606 


0.17366 


CI dimer 


-11.1 






(002) 


-14.4 






(020) 


-13.1 






(200) 


-15.5 






Cro dimer 


-7.0 






AGcro-DNA 


— 6.5/bp 







Table 1: CI and Cro dimer affinities to OrI, Or2 and Or3 after, respec- 
tively, (Koblan & Ackers, 1992), and (Takeda et al, 1989, 1992), (Jana et al. 
1997, 1998) and (Kim et al. 1987). A major source of uncertainty of in vivo 
Cro bindings is that all in vitro experiments for Cro have been performed at 
or below room temperature. The CI dimerization is from (Koblan & Ackers 
1991), Cro dimerization from (Jana et al. 1997), and nonspecific Cro dimer 
binding to DNA from (Kim et al., 1987). The third and fourth columns 
display the probabilities of different states of binding of CI to the sites in 
Or, with different assumed nonspecific affinity of CI to DNA. For binding 
of single CI dimers to left operators the relative affinities compared to OrI 
(AG R (0, 0, 1)) are AAG L (s) = {1.0, 0.8, -0.2} in units of kcal/mol, where s 
are the states (0,0, 1), (0, 1,0) and (1,0,0), respectively. The relative affini- 
ties of binding of single Cro dimers are similarly AAGl(s) = {—0.1, 0.5, 1.0}, 
where the reference energy is AG R (0,0,2) (Takeda et al). 
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Or 


Rr 


<7 

'-'cro 


^Cro 


CI 


Cro 


lysis frequency 


type 




number 


M 


number 


number 


[generations] -1 


321 


0.30 


20 


3600 


200 


0.8 


1.4- 10~ 9 


321 


0.30 


20 


7200 


200 


0.8 


1.7- 10~ 8 


321 


0.60 


20 


3600 


200 


1.4 


3- 10~ 7 


321 


0.30 


40 


3600 


200 


1.6 


2- 10" 5 


121 


0.30 


20 


3600 






> 0.1 


323 


0.30 


20 


3600 






> 0.1 


121 


0.030 


20 


3600 


32 


8 


1.3 • 10~ 6 


323 


0.030 


20 


3600 






~ 0.1 


121* 


0.030 


20 


3600 


32 


8 


4- 10- 6 


323* 


0.030 


20 


3600 


85 


34 


0.04 


121 


0.0030 


20 


3600 


33 


1 


5- 10- 7 


323 


0.0030 


20 


3600 


122 


2 


1.5 • 10~ 6 



Table 2: Switching stabilities and CI and Cro numbers as function of model 
parameters, for wild type and the Little mutants. Top row is the standard 
configuration, as described in the main text. Second through fourth row 
explores the changes in lysis frequency and average number of Cro present 
in wild type by varying R R , S cro and t Cro . Fifth to twelfth row show the 
changes in Little 121 and 323 mutants upon changing i? R . The rows marked 
121* and 323* explore an additional change of -R R m> the rate of unstimulated 
transcription from P RM to -R RM = 0.30-Rrm- The standard parameter value 
(without *) is i? RM = .Rrm/11- We observe that the 323 mutants fail to sta- 
bilize unless i? R is decreased a hundredfold. The stability of 121 is increased 
a million times (compare row 5 and 11), as is also wild-type. 
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